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fT^ , A decentralized search algorithm is a method of routing on a ran- 

dom graph that uses only limited, local, information about the real- 
ization of the graph. In some random graph models it is possible to 
define such algorithms which produce short paths when routing from 
any vertex to any other, while for others it is not. 
i-G ' We consider random graphs with random costs assigned to the 

C^ , edges. In this situation, we use the methods of stochastic dynamic 

programming to create a decentralized search method which attempts 
to minimize the total cost, rather than the number of steps, of each 
path. We show that it succeeds in doing so among all decentralized 
search algorithms which monotonically approach the destination. Our 
[^ ■ algorithm depends on knowing the expected cost of routing from every 

f~^ , vertex to any other, but we show that this may be calculated itera- 

VO ' tively, and in practice can be easily estimated from the cost of previous 

^^ I routes and compressed into a small routing table. The methods applied 

^^ ' here can also be applied directly in other situations, such as efficient 

^^ , searching in graphs with varying vertex degrees. 
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• ^ ; 1 Introduction 

X 

H 

C^ I Jon Kleinberg introduced the concept of decentralized search algorithms in 

his celebrated 2000 paper on the Small- World phenomenon [1]. In particular, 

he showed that in certain random graphs it is possible to find paths between 

vertices of poly-logarithmic length even when limited to using only local 

knowledge at each step, while in others it is not. 

Most of the by now large canon of work in the area (see [2] for a recent 
survey) has been dedicated to finding and analyzing algorithms that route 
between two given vertices in a small number of steps. Typically, the best 
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Figure 1: If the goal is to reach vertex 9 in the fewest steps, then certainly 
vertex 1 should choose the long edge to vertex 7. However, if the goal instead 
is to minimize the total cost, routing to 2 might be a better choice. 

method in these situation is greedy: progress to the neighbor which is closest 
to the destination. In this paper, we consider a generalized situation where 
the cost of passing down an edge is not fixed, but may be a random variable 
whose value is known when the choice of where to proceed to next is made. 
The goal is then the minimize the cost of reaching the destination, which 
may lead to different priorities when routing (one may not wish to route to 
a vertex very close to the destination if the cost of passing down that edge 
is very high, see Figure [1]). This is a problem similar to that applied to 
time process in the field of stochastic dynamic programming P], and we use 
similar methods. 

The basic idea is this: If a vertex knows the expected cost of each of his 
neighbors routing to the destination, and also the cost of him routing to 
each of his neighbors, it makes sense for him to choose as the next step that 
neighbor which minimizes the sum of these two costs. While the expected 
costs are difficult to calculate analytically, we find that there is little rea- 
son to do so. When many queries are performed, one may start by using 
any guess as to these values, and then update these guesses based on past 
experience. 

We will show analytically that this form of search is well defined, and that 
it is optimal among algorithms that monotonically approach the destina- 
tion, as well as presenting some results on the order of the total cost as 
well as approximation trade-offs. We continue with a discussion and some 
experimentation on the practicality of the approach. Finally, we apply the 
algorithm with greater generality, and see that it performs well also in these 
cases. In particular, we note Ithat it can be applied to routing with non- 
homogeneous degree distributions. 

1.1 Previous Work 

The original work on decentralized search was done by Kleinberg in [1] and 
[3]. Much work has since been done on related problems, in particular 
further generalization of the results and improvements on the bounds (see 
[5] [6] [7] [8] [9] for some examples). 



In a bid to improve the performance of Preenet [TO], a decentralized peer- 
to-peer network, Clarke [11] proposed an algorithm in some ways similar to 
that presented here under the name "NG Routing". The method was im- 
plemented in Freenet at the time, but has not been used since the network 
was re-engineered to route according to the method described in [12]. We 
believe that the technical and architectural problems experienced by previ- 
ous versions of Preenet were unrelated to what is discussed below, and, in 
light of our results, that Clarke's ideas were fundamentally sound. 

§im§ek and Jensen also proposed an algorithm based on the same principles 
in [13], intended for routing based on both vertex degree and similarity. 
Their "expected-value navigation" is based on the same idea as our cost- 
greedy search, however they use a rough estimate of the expected routing 
time, which cannot be applied to our problem, to make the decisions. They 
present no analytic results. For our take on search in graphs with variable 
node degree, see Section 16.21 below. 

For an introduction to the field of stochastic dynamic programming, see the 
monograph by Ross [3]. 

1.1.1 A note on terminology 

The terminology regarding algorithms for decentralized path-finding in ran- 
dom graphs has not yet settled, and different authors have used different 
terms. We have chosen the term "decentralized search" (following recent 
work by Kleinberg [2]) but others terms have been used to describe the same 
thing. "Navigation", used in e.g. [5] [H], is quite common, but perhaps not 
as descriptive. We avoid formally calling our algorithms "routings" as this 
name has previously been used in computer science literature to describe 
flow assignments through a graph with limited edge capacities (for example 
[15|). a different problem from that currently studied. However, we do use 
the terms "route" and "to route" following their dictionary definitions. 

1.2 Organization 

In Section [2] we set out the basic definitions of decentralized search, as well as 
rigorously define our new "cost-greedy" search algorithm. Following this, in 
Section [3] we prove the main results regarding cost-greedy search in networks 
with sufficient independence. In Sections S] and [S] we set out the methods 
for applying the results in practice, and, finally, in Section [6] we perform 
simulated experiments to look at the actual performance of the algorithm. 



2 Definitions 

A cost graph G = {V^ E, C) is a graph consisting of a vertex set V, a possibly 
directed set of edges E, and a collection of costs, C. For each element 
{x,y) € E, C{x,y) = C{{x,y)) is an i.i.d. positive random variable giving 
the cost of traveling down that edge (time taken) . 

G may be a random graph, by which we mean that for each x,y £ V, there 
exists a random indicator variable E^^y saying whether there is an edge from 
X to y. These may be dependent and differently distributed. 

Definition 2.1. For a given cost graph G, a z-search for a vertex z £ V, 
is a mapping A : V >-^ V such that: 

1. A{x) = y only if (x, y) S E. 

2. A{z) = z. 

3. For all X £ V there exists is k < oo such that A^{x) = z. 

A search of G is a collection of z -routing algorithms for all z dV. 

We call d a distance on aset V if d : V xV i-^ M+, if for x,y G V, d{x, y) = 
implies that x = y, and if for z,y,z € V, 

d{x,z) < d{x,y) + d{y,z). 

A distance is thus a metric without the symmetry requirement. In particular, 
any connected digraph G implies a distance do- For x G V, N{x) = {y £ 
V : dcix, y) = 1} is the set of neighbors of x in G. 

Definition 2.2. A distance d is adapted for search in a connected graph 
G if for every x,z G V, where x ^ z, there exists a y a N{x) such that 
d{y,z) < d{x,z). 

A distance function d is thus adapted for search if it, in some sense, reflects 
the structure of G. The most obvious example is of course do itself, but d 
may also be, for instance, graph distance on any connected spanning sub- 
graph H of G. Another important case is that if 1/ is a set of points in a 
metric space, then the space's metric is adapted for search in V's Delaunay 
triangulation (see [9]). 

Definition 2.3. Given a cost graph G, and a vertex z, a decentralized 
search is a z-search A, such that for any x & V the random, variable A{x) 
is measurable with respect to: 

1. Er,y for all y eV. 



2. C{x,y) for all y € N{x). 

Intuitively, this means that as well as any information about the graph 
model, routing at x may use information about which vertices x does (and 
does not) have edges to, as well as the costs of passing down those edges. 
The definition of decentralized search as originally given by Kleinberg was 
slightly broader than this, allowing a route started at a vertex x to use all 
the information from A^{x),A^{x), . . . ,A{x) when taking its A;-th step. 
Because our analysis will be restricted to algorithms meeting the following 
criteria, excluding this information will make little difference: 

Definition 2.4. Given a graph G, a distance d adapted for search in G, 
and a vertex z, a forward search is a z-search T such that for all x € y\{z} 

d{T{x\ z) < d{x, z). 
For a given search A and vertex z 

S'{x;A)=mi(k:A''{x) = 
is the number of steps it takes to reach z from x using A. Let 

T^{x-A)= Y. C{A'-\x\A\x)) 

i=l 

which is the cost. 

2.1 Greedy and Weighted Greedy Search 

In the following, we fix z. Greedy search is given by 

Ag{x) = argmin(d(y,z)). (1) 

y&N{x) 

This is always a decentralized search, and if d is adapted, than it is a forward 
search (if d is not adapted, it may not be well defined to begin with). 

Standard greedy search does not take the costs G into account. A variant 
that does, is weighted greedy search. For a given z, let Wz{x) for x G F be 
a collection of weights, where Wz{z) = 0. These weights specify a search 
algorithm 

A^{x) = argmin(C(x, y) + Wz{x)) (2) 

y£N{x) 

which may or may not be well defined. If we restrict this to being a forward 
search, we get 

Tw{x) = argmin {G{x,y) + Wz{x)) (3) 

yGN(x) : d{y,z)<d{x,z) 



which is always a well-defined if d is adapted for routing in G. 

The behavior of weighted routing depends on w. If t(;(x) is strictly increasing 
in d{x, z) then Aw and J^„, are Ac when the costs are constant. If w{x) = 
for all j; G y, then JP"^ will simply choose the edge approaching z with the 
lowest cost. Since we are trying to minimize the cost of routing, it makes 
sense to see Wzix) as a guess of the cost of reaching z from x. Imagine that 
we are given a black-box function /, where 

f{x,A) = E[T^{x;A)] 

that is, / tells us the expected cost of routing from x using any given algo- 
rithm A. If a vertex x knows that all its neighbors will use the algorithm 
A to route, it makes sense that it should want to use the y given by ^ (or 
([3]) if restricted to forward search) with Wz{y) = f{y,A). Extending this 
reasoning to any vertex motivates the following definition. 

Definition 2.5. A forward search T is called cost-greedy search for z if 
T = Tw, where the weights w are given by the solution to the equations: 

w{x) = B[T^{x;J^u,)] xeV (4) 

Equation H] is what is the routing equivalent of what is known in stochastic 
dynamic programming texts as the optimality equation. We will show below 
that a solution exists in this context, that this solution is a globally attractive 
fix-point, and that cost-greedy search is optimal with respect to expected 
routing cost for all forward searches. 



3 Results for Independent Graphs 

Let d be an distance, and y -^ x denote the event that there is an edge 
from y to X (we allow multiple edges per pair). We let G be a random 
graph model on which d is adapted for routing (with probability 1), and 
for which y,x,v,u E V with d{y,z) > d{x,z) and d{v,z) > d{u,z), y -^ x 
is independent of w ^^ u if y ^ v. Examples of such graphs are adding 
outgoing edges from each vertex with destinations chosen independently (as 
in Kleinberg's work [T]) or allowing each edge, either seen as directed or 
undirected, to exist independently of all others (like in classical random 
graphs and long-range percolation [H]). We call a graph constructed in this 
manner edge independent. 

Theorem 3.1. For an edge independent random graph G, there exists a 
solution w to equation ^ so that cost-greedy search is well defined, w is a 
globally attractive fix-point of the iteration given by 

Wi+i = nT'{x;Fwd]- (5) 



Proof. Define the rank of x witli respect to z, rz{x), as tlie position of x 
when ah the elements in y\{z} are ordered by increasing distance from z, 
using some deterministic tie-breaking rule. Let rz{z) = 0. 

We will proceed by induction on rz{x). 

Let wq be any weighting. Let x & V such that rz{x) = 1. For any forward 
search T, J-{x) = z, whence ^\T^{x;J-)] does not depend on the algorithm. 
In particular E[T^(3;; J^w)] is does not depend on w, whence Wi{x) is constant 
for i > 1. 

Let rz{x) = /c, and assume that for all y € F such that rz{y) < r^^x), Wi{y) 
takes the same value for all i > A; — 1. This means that E[T^(x; JF^. )] takes 
the same value for all i > k. 

It follows that for all x € ^ Wi{x) is fixed for i > rz{x). Hence Wk = w^^i 
for all k > n — 1, and w = Wn is a solution to (jl]). D 

Theorem 3.2. Let G be an edge independent graph, and J^ a forward search 
for a vertex z £ V . Then for all x ^V 

where Ta, is cost-greedy search for z as in Definition^. 



Proof. Use the same definition of rz{x) as in the proof of Theorem 13. li Like 
there, we will use induction on rz{x). 

If rz{x) < 1, then all forward searches from x are the same, and there is 
nothing to prove. Let J- be any forward search. Given x £ V, assume that 
for allyeV such that r,(y) < r,(x), B[T'{y;J^^)] < B[T'{y;T)]. 

Let V = T{x) and v* = Tt{x), the places the respective algorithms choose 
as the next step. Below, we mean by "local knowledge" that which decen- 
tralized algorithm may use, as given in l2.31 We note, crucially, that because 
of our assumptions, T^{v]J-) is independent of local knowledge at x, while 
C{x,y) is measurable with respect to it. 

E[T'^(x; J^)] = E[E[T'^(x; J^) I local knowledge at x] ] 
= B[D{x,v) + E[T'{v;T)]] 
>B[D{x,v)+E[T'{v;J'^)]] 
>B[D{x,v*) + -E[T'{v*;T^)]]=E[T'{x;T^)] 

where the first inequality follows by induction since r^(f) < rz{x) and the 
last because the expression inside the first expectation is what v* minimizes. 

D 



3.1 The Small- World Graph 

A particular example graphs meeting the criteria of the last chapter are the 
small- world augmentations first introduced by Kleinberg [1]. This construc- 
tion starts with a fixed finite grid H, letting d = dn, and creating G by 
adding a random outgoing directed edge from each vertex x to destination 
a y with probability 

P{x-^y)cxl/dG{x,yr (6) 

d is naturally adapted for routing in G. 

For simplicity, we let H he a ring of n vertices (Kleinberg originally used a 
two-dimensional square lattice, but the proofs are identical). Let G{n,a) be 
the family of random graphs so constructed. 

Using previous results about greedy routing on such graphs, we can calculate 
the cost order of cost-greedy search. In particular, we can see the order in 
n cannot be different from greedy routing. 

Theorem 3.3. (Kleinberg) If G & Gin, a) with a = 1 there exists Ni such 
that for n > Ni, 

E[S'^{x;Ag)] <kilognlogd{x,z) 
where ki is a constant independent of x, z, and n. 

Further results about such graphs, proved in [5] and [ITj respectively are 

Theorem 3.4. (Barriere et al.) If G ^ Q{n,a) with a = 1, then there 
exists N2 such that for n > N2 , 

¥^[S^{x;Ag)] > k2lognlogd{x,z) 

where ^2 is a constant independent of x, z, and n. 

Theorem 3.5. (Singh Manku) If G ^ Q{n,a), with a>0, then 

B[S'ix;AG)]<nS'ix;A)] 
for any x,z £ V{G) and decentralized search A. 

Together, these allow us to prove the observation that 
Proposition 3.6. If G £ G{n,a) and < E[C(x,y)] < 00 then 
E[T'{x;Tn,)] = e{lognlogd{x,z)) 



Proof. The upper bound comes directly from Theorems 13.21 and 13.31 since 

E[T'ix;Tn,)]<E[T'{x;AG)] 

= B[C{x,y)]B[S'{x;AG)] 

< E[C{x, y)]ki log n log d{x,y). 

where the middle equality follows from the fact Ac routes independently of 
the costs, and the simple form of Wald's equation. 

To prove the upper bound, consider all edges in the graph as directed, letting 
the edges of the H be denoted by double directed edges. Since a forward 
search can only ever traverse an edge in one direction, this does not affect 
its cost. Now let 

R'{x;T)= Yl C^niH^r^) (7) 

where Cuim{x) = min{C(x,y) : y G N{x)}. This counts, at each step, the 
minimum cost of any outgoing edge, rather than the cost of the edge which 
was actually used. 

Since the degree of each vertex is fixed, Cmin(a^) is i.i-d. for all x. Let Si be 
the (T-algebra generated by all the information seen in steps 1, 2, . . . , i of the 
search (as listed in Definition 12. Sp . Note: 

• C'min(-^*'''^(2;)) is independent oi Si. 

• Sz{x;A) is a Stopping Time with respect to the filtration {Si}^^. 

Thus we may use Wald's Equation to conclude that 

E[i?^(x;.F)] =E[C^i„(x)]E[5^(x;.F)]. (8) 

We now use the immediate fact that R^{x;J^) < T^{x;T), followed by ([8]) 
and Theorems 13.51 and 13.41 to conclude 



B[T^x;J^)] >B[R'{x;T)] 

= B[C^i^{x)]B[S^x;T)] 

>B[CraUx)]B[S'{x;AG)] 

> E[Cmin(x)]fc2lognlog(i(x,z) 

for sufficiently large n. Since this holds for any forward-search T, it holds 
in particular for J^yj. D 

Proposition 13.61 tells us that in this model, the order of cost-greedy routing 
will not be different from that of greedy routing. The proof of the lower 
bound assumes, however, that E[C(a;,y)] < oo and that the degree of each 
vertex is bounded as n grows. Neither of these things, and particularly not 
the latter, necessarily hold in applications. 



3.2 Approximated Weights 

We consider the situation when the solution iD to ([S]) is not known exactly 
but approximated by another set of weights. 

Proposition 3.7. If w is the solution to ^ and w another set of positive 
weights such that 

max 1^(3;) — u;(x) | < e 

xGV 

then for any edge-independent graph of size n 

B[T'{x; T^)] - E[T'{x; T^,)] < 2ne (9) 

and more generally, for any /c > 

E[r^(x; T^)] - E[r^(x; T^i,)] <2e{k + nP(5"(x; .F„,) > k)) . (10) 

Proof. Let err^(x) = E[r^(x; jf^^)] - E[T^(x; JSi,)]- It follows that 

E[T^(x;^^)] = E[C;(x,.F^(x)) +E[r^(J-^(x);.F^)]] 

= E[C7(x,^^(x)) +E[r"(.F^(x);^^)]] +err^(J-^(x)) 

= E[C(x,J^^(x)) +w{Tu,{x))\ +eTTyj{Tu,{x)). 

Now, since by the definition of a weighted greedy search C(x,F^(x)) + 
w{!Fw{x)) < C{x,y) + w{y) for all y G N{x) 

< B[C{x,T^{x))+w{T^i,{x))-w{T^{x))+w{J'^{x))] 

< B[C{x,Tu,) + w{T^{x))] + E|w;(^^(x)) - w{T^{x))\ 

+ E|w(:^^(x)) - wiJ^wix))] + err^(J^u,(x)) 

< E[r"(x; J-^)] + 2e + err^(.F^(x)) 

It follows that for any k > 

err^(x) < 2ek + err^(.F^(x)). (11) 

li k > S'^{x\Tyj) then T^{x) = z and err^(.7^^(x)) = 0, so ([9]) follows since 
n > S^{x; A) for all searches. 

To prove (|1U|) . note that by the same reasoning 

err^(^^(x)) = err^(^^(x) | .F^(x) / z)P{S'{x;T^) > k). 

Since the graph is edge independent err^(^^(x) | JF^(x) 7^ z) is simply the 
error from some point which is not z, but where ([9]) still applies. D 
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What the proposition, and in particular ()lip says is that if an approximation 
w oiw is off by e, then each step in the routing adds at most 2e to the optimal 
routing time. This is intuitively clear, since while Tw may choose the wrong 
vertex in a given step, it can only do so when the total (actual) cost of 
routing via that vertex is within 2e of the cost of routing via the real one. 
For the same reason, it is unlikely that a better bound can be achieved 
without further assumptions on the graph and the cost distribution. 



4 Calculating the Weights 

Theorem 13.11 provides us with a method of calculating the weights w for 
cost-greedy search. One can start by assigning any initial weighting wq, and 
then calculate wi,W2, ■ ■ ■ using ([5]) . 

A closed analytic form for 'E[T^{x;J-'w)] as a function of the vector w is 
probably very difficult to find, even in the most simple situations. One can 
note however that it can be written recursively as 

E[T^(x; T^)] = Y, (E[r^(x; T^)] + E[C{x, y) \ T^{x) = y]) V{:F^{x) = y) 

where the sum is over all y € y such that d{y,z) < d{x,z). In the 
very simplest cases (such as a directed loop with one augmented outgo- 
ing shortcut chord per vertex) it is possible to calculate P(.7\„(x) = y) and 
E[C(x,y) \J^w{x) = y] analytically, in which case E[T^{x;Tw)] can be cal- 
culated numerically by recursion. Because this is complicated, and unlikely 
to be of much interest in practice, we do not linger on it. 

A much more rewarding strategy is to calculate the weights empirically. 
That is, start by simulating a large number of searches from randomly chosen 
points using Two- While this is being done, sample the average routing cost 
to z from each vertex (due to the Markovian nature of forward search on 
an edge independent graph, a vertex may take a sample every time a query 
passes through it). After a sufficient number of queries, the average should 
be an estimate at 'E[T^{x;Two)] by the law of large numbers. One may 
then take the average costs from each point as wi, and continue in this 
manner. Proposition 13.71 indicates how close an approximation is needed, 
but unfortunately it is not strong enough to derive a rigorous bound using 
a polynomial number of samples. 

Further, we note two things about the sampling implemented. Firstly, one 
needs to be careful about the way the repeated queries are done. Since we 
want the edge costs to be random, C{x, y) must be picked anew, indepen- 
dently, for each query sampled. If the graph is random, the edges may be 
redrawn, but must not - it simply depends on whether they are to be seen 
as random or fixed edges in the G above. 
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Secondly, the proof of Theorem 13 . 1 1 guarantees convergence in n steps, mean- 
ing that an optimal routing is achieved once Wn has been calculated (if an 
empirical method is used, the resulting weighting may still suffer inaccuracies 
due to the sampling). This is an unfortunately large number of iterations, 
especially given that each may require simulating a large number of queries, 
but we find that in practice, much fewer iterations (typically two or three, 
even for very large networks) are needed, see Section [B] below. 

5 Practicality and Decentralization 

We proceed to discuss actual applications of Definition 12.51 On the face of 
it, the routing method described does not seem particularly practical. Even 
if we can calculate the weighting w, this gives a routing table of size of n, 
and such a table is needed for each z we wish to route for. The complete 
table of weights needed to route between any two vertices is thus of size n^. 

Several assumptions can help here however. 

Centralization: Translation Invariance 

If we assume that the graph is translation invariant, then 'E[T^(x;J-)] = 
E[T^ {x — z; J-)] so X needs only know the routing cost from each starting ver- 
tex to a distinguished vertex 0. In fact, in many cases (such as the common 
case of augmenting single cycle with random outgoing edges) E[T^(a;;^r)] 
may be exactly, or at least approximately, a function of d{x,z), in which 
case X need only know the expected cost of routing a given distance. This 
knowledge is the same for all x, so may be calculated as a single, global, 
vector. 

Decentralization 

If one wishes for a completely decentralized search system, as, for instance 
in peer-to-peer systems such as jlUj . then one cannot store a global vector 
of weights. Instead, each vertex must store the weights needed to route to 
every other vertex. In particular, each vertex x needs to be able to calculate 
Wz{y) for each y G N{x) and z (^ V. If one assumes translation invariance, 
X need only store one such weight vector, and can translate it to apply to 
its neighbors. Without such invariance, it needs to store |-/V(2;)| vectors. 

Weight Vector Compression 

In both cases above, however, we are still left with a routing table size of 
at least n, which is definitely not desirable. The heart of what makes our 
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Figure 2: Rather than storing every value of w vector (the dotted hne) as a 
routing table, we store values at exponentially increasing positions, and use 
these to approximate the values between them. 

method practically useful comes from the fact that the previous theory about 
decentralized search makes compression to a logarithmic size possible. 

If we consider graphs of type G{n, a) described in Section [3. 11 we know from 
Proposition 13.61 that 

E[T'{x,Ti,)] « cE[C]log{n)log{d{x,z)). 

The utility of this is that if we know that E[T^(x, J^t)] grows logarithmically 
with d{x,z) (as indicated by Proposition 13. 6p . we are motivated to assume 
that it, and thus the weights w in Definition 12.51 can be approximated 
by assuming w{x) and w{y) have similar values if x and y are such that 
log(d(x, z)) ^ log{d{y, z)). In particular if < d{y, z) — d{x, z) < r we get 



\w{y) — w{x)\ ^ ciE[C] log(n) 



d{x, z) 



(12) 



It is easy to prove, using the same methods as in the proof of Theorem 13.3 
that for greedy routing in Q{n, a) 



¥{S'{x-AG)>\og^n)<C2 



logn 



n 



Assuming that a similar bound holds for Tw , equation (jlOp with k = log n 
in Proposition 13.71 gives that 

E\T''(x;Tuj)] -E[T''(x;Tyj)] < 4max |^Z)(x) - it;(x)| log^n 

so if r < ed{x,z) /4:\og n in (fT2]) . then 

E[r^(x;.F^)]-E[r^(x;.F^)]<e. 

Thus, the weight w{x) can be substituted by the weight of a vertex ed{x, z)/ log'^ n 
steps from x. To do this, we divide the routing distances into zones of size 
2* for i = 0, 1, 2, . . ., and record only the weights of log n/e evenly spaced 
vertices within each zone (Figure [2]). The routing table thus contains a 
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Figure 3: The steps taken by greedy and cost-greedy search when all edge 
costs are fixed to 1. In theory, the latter should converge to the former 
(which is optimal) but due to the inaccuracy of the estimates and the 
routing-table compression cost-greedy here performs about 5% worse than 
the optimum. 



polylogarithmic number of entries (0(log n)), and yet by using the clos- 
est recorded weight as a substitute for w^x), we incur only an e-error on 
the total routing cost. Proving this rigorously, however, depends on tighter 
bounds then Proposition 13.61 or even Theorems 13.31 and 13.41 provide. 

We will see experimentally in Section [6] that a routing table of size around 
E[S^{x; Ag)] works well in practice, both when using a single vector and in 
a decentralized system. 

While it may seem like a limitation that this will only work on graphs where 
routing in a logarithmic number of steps is possible, those are likely to 
account for most situations where decentralized search is of interest. Beyond 
Kleinberg's small-world model, other cases where decentralized routing is 
expected to take a logarithmic number of steps are hypercubes (where the 
hamming distance is adapted for routing), and Chord networks [18] (where 
the circular distance is). 



6 Experiments 

6.1 Direct Applications 

We start by simulating the algorithm under the most basic conditions. We 
let G consist of a single directed cycle of n vertices, augmented with logn 
outgoing shortcuts from each vertex, according to Kleinberg's small-world 
model. That is, each shortcut from x is to an independently chosen vertex 
selected with according to ([6]) with a = 1, which in this case translates to 
l/hnd{x,y), where d is distance in H, and hn ~ logn is a normalizer. 
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Figure 4: The cost and steps taken by cost-greedy and greedy search on 
networks with Exp(l) distributed costs along each edge, plotted against the 
size of the network. Networks consist of directed rings with log2 n directed 
shortcuts per vertex. 
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Figure 5: The cost and steps taken by cost-greedy and greedy routing on a 
network of size 262144 with Exp(l) distributed costs along each edge. Cost- 
greedy performance is plotted against iteration of the system in Theorem 
13.11 starting with all zero weights. 
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We start by assigning wo{x) = for all x G V, and calculate the expected 
routing times by simulating 20n queries between randomly chosen points 
(this number of iterations is probably excessive). We re-sample the costs 
for each query, but the graph is kept the same. However, because only 
one sample vector of the expected routing times over each distance is kept, 
we still end up marginalizing over the shortcuts. We use the logarithmic 
compression oiw described in SectionO(in practice, we find this outperforms 
using a full w vector except when an extremely large number of queries is 
simulated), and use wiq as an estimate of the final value. 

The difference between cost-greedy and standard greedy search in terms of 
query cost depends crucially on the distribution of C{x,y). Quantitatively, 
it is possible to make the benefit of cost greedy as large (or small) as one 
wishes by a strategic choice of this distribution. For example, if 

^ , , I 2 with probability i 
C{x,y) = < ^ 

I otherwise. 

then cost-greedy search will most often incur zero cost assuming the vertex 
degree is large enough (as will a simple lowest cost routing). It would thus 
be dishonest of us to claim that our methods are motivated based on the 
performance achieved with delays chosen by us. The experiments in this 
section are thus meant to verify that cost-greedy search behaves as expected, 
rather than to illustrate its benefit: the potential benefit of the algorithm 
must be evaluated for every particular situation where it may be applied. 

Our first experiment, shown in Figure O is thus to see what happens if 
we fix the costs to 1 for all the edges. In this case cost and steps are the 
same, and since it is known (Theorem 13. 5p that greedy search is optimal in 
the expected number of steps, the theory tells us that cost-greedy should, 
ideally, give the same value. In fact we find that it under-performs by about 
5% in all the sizes tested - presumably due to the empirical estimate of the 
expected value, and the losses due to the logarithmic compression of the 
weight vector. 

Figure m shows the performance of cost-greedy when the costs are exponen- 
tially distributed as a function of the graph size. We choose an exponential 
distribution simply because it is a common model for waiting times, and the 
mean of 1 means that the cost and steps of a route are of the same scale. 
We see, as expected, that cost-greedy search is able to produce routes that 
cost less by taking more steps than normal greedy search does. In Figure 
Owe plot performance for a single network size against the iterations of (0) 
when starting with all zeros. We see that even in a network of hundreds of 
thousands of vertices, no measurable performance is gained after the fourth 
round - supporting our hypothesis that convergence is a lot faster than the 
bound given above. 
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Figure 6: Example of a network with two different vertex types - one tenth 
of the vertices are augmented by 55 outgoing edges, whereas the rest get 
just 5. All edge costs are exactly one, and the values are averaged over four 
simulations to decrease variance. "S J Method" is the method of §im§ek and 
Jensen described in Section [6l2l 



6.2 Out-degree Distribution 

Another question that has been asked about navigability is how to route in 
a network if the vertices have variable degree, and if the degree as well as 
the position of the neighbors is known when the routing decision is made. 
This problem is motivated by the nature of social networks, which appear to 
be navigable, but where it is known that the vertex degree follows a heavy- 
tailed power-law. This problem is in many ways similar to that which we 
discuss above: like with the edge costs, degree distributions may incentivize 
away from a pure greedy strategy, and instead call for a trade-off between 
getting close to the destination, and other factors (in this case, wanting to 
route a vertex with high degree). 

§im§ek and Jensen |13j have studied this problem by simulation. Their 
method is fundamentally similar to ours: they also seek to choose the neigh- 
bor which minimizes the expected number of steps to the destination (as we 
do if the costs are fixed to a unit value). However, rather than attempting 
to calculate the fix-point of the weights, they make a rough approximation 
of the value using the right-hand side of the inequality 



E[S'{x)] > P{S'{x) > 1) = P(z ^ N{x)) 
to estimate the left. 



(13) 



To apply our methods above to the problem, we let the weights be a function 
not only of the distance to the destination, but also of the degree of the 
vertex. Since this question related only to the number of steps, we fix all 
the costs to 1. We expect the weight for any particular distance to be smaller 
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Figure 7: The same situations as in Figure HI but now including the results 
when using a separate weight vector at every vertex in the manner described 
in Section 16.31 A round is 20 x n simulated queries as above. 

for vertices with higher degree (since the amount of ground gained in the 
first step should be better). 

Figure El shows a simple example of this. In that case we have exactly two 
possible out-degrees: a few (O.ln) of the vertices have 55 shortcuts, while 
the rest have just 5. We compare the cost-greedy search as used above with 
regular greedy search and the method of §im§ek and Jensen. The results 
seems to vindicate the approximation used in the latter method, with this 
distribution it slightly out-performs cost-greedy search, meaning that the 
numerical losses in estimating the true weights are greater than the analytic 
loss of the approximation. This seems to be the case for most sensible such 
distributions, we find that cost greedy search only takes a slight lead when 
the popular vertices have more than a hundred times the degree of unpopular 
ones. If we presume that cost-greedy search can come within 5% of being 
optimal also here, we are forced to conclude that so does the SJ method. 

One advantage that cost-greedy search has over the method of §im§ek and 
Jensen, is that their method requires detailed knowledge about the model 
in order to calculate the right-hand side of (|13|) . which cost-greedy search 
does not. Algorithmically, any vertex may implement cost greedy search for 
its queries, and it needs only have the ability to to measure the cost of the 
queries it sends to its neighbors, nothing more. 

6.3 A Decentralized and Generalized Implementation 

To look at the practical viability of the algorithms described above, we also 
simulate a completely distributed variant. In the decentralized variant, we 
equip each vertex with its own weight vector, measuring the mean cost of 
routing from it to destinations at varying distances. Like before we use a 
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Figure 8: Using a separate weight vector at every vertex in the manner 
described in Section [631 Costs are Exp(l) distributed, and out-degrees are 
distributed according to a power-law with tail-exponent 2. "SJ Method" is 
the method of §im§ek and Jensen described in Section 16.21 



log2 n compression of the weights - coalescing all distances between 2 and 
2*:+i [^iq ii^Q same entry - but unlike above we do not calculate each weight 
by sampling over a fixed number of queries. Instead, we let the weight vector 
at each vertex x be calculated as the mean of the entries in a FIFO buffer, 
showing the cost of the last m queries x has routed destined for vertices 
of every distance category. As before we do not change the edges of the 
graph between the queries from which the weights are estimated. Because 
no marginalizing is occurring here, the graph model is actually one fixed 
realization for each size - the expectation is actually taken only over the 
costs. 

To route a query, x uses the weight vectors of each of the vertices in N{x) 
to minimize ([5]). In a real world implementation, these values could be peri- 
odically copied between neighbors. One problem we find with this method 
is that if X initially has m queries in a certain distance category that incur 
a very high cost, he will not attract more queries in that category from his 
neighbors (who see it as very costly to send such queries to him), meaning 
it takes a long time to clear the errant values from the FIFO buffer. Even- 
tually the buffer will be replaced, if not otherwise then by the cost of the 
queries initiated at x itself, but in our simulations we find that this slows 
the convergence. To alleviate this, we keep a count of the number of queries 
X receives for each distance category during an interval. The theory says 
that these should be equal, so if one of the counts has fallen a lot behind (is 
less than a quarter of the queries x receives for itself) we set all the values 
in the buffer to 0. 

Even with this method, the convergence is, as expected, slower than in the 
centralized version. Figure [7] shows the equivalent of Figure H] but using 
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local weight vectors at each vertex. Here we let m = 20. We can note 
three things: there is an absolute performance cost of the decentralized 
version, the cost seems to get worse for larger sizes, but it still considerably 
outperforms greedy search. The first is probably due to each estimate of 
the expected routing time being based on far fewer values, while the second 
is due to us not simulating enough queries for full convergence at the large 
sizes, as seen by the increasing difference between the values as 10 and 20 
rounds. We note that even 20 rounds is actually only 400 queries initiated 
at each vertex - a large number when we must simulate it for a quarter of 
a million vertices, but very little compared to the number of queries one 
would expect in most DHT's or other deployments of distributed networks. 

Finally, in Figure [8] we use the decentralized method to route in a situation 
when we both have exponential edge costs, and vertices of varying out-degree 
(in this case a power-law with P(|A^(x)| > i) ps t~'^). Decentralized cost- 
greedy search can optimize both for varying costs and vertex degrees at the 
same time. 



7 Conclusion 

We have presented a method for decentralized search that takes varying 
costs of routing down different edges into account. We have showed that this 
method is optimal among all such algorithms that monotonically approach 
the destination of the query, and that the necessary weights can be calculated 
iteratively. On small-world graphs, we can calculate the order of costs, and 
say something about the approximation cost. Beyond these analytic facts, 
have presented a number of techniques which make the algorithm practical, 
and experimented with actual implementations using simulation. 

It would be very desirable to be able to better motivate our approxima- 
tions rigorously. To do requires strengthening propositions 13.61 and 13.71 and 
perhaps a lot of work beyond that. In the short term, proving that any 
polylogarithmic routing table, and any polynomial number of samples, is 
sufficient would be a big improvement. 

The long term goal of these studies is to try to find adaptive methods for 
decentralized search when the edge costs are not independent random values, 
but depend, for instance, on the number of queries that have passed down 
the edges recently. Such methods for routing with congestion are of interest 
to deployments of peer-to-peer and other distributed systems. 
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